A better coefficient of determination for genetic profile analysis.
نویسندگان
چکیده
Genome-wide association studies have facilitated the construction of risk predictors for disease from multiple Single Nucleotide Polymorphism markers. The ability of such "genetic profiles" to predict outcome is usually quantified in an independent data set. Coefficients of determination (R(2) ) have been a useful measure to quantify the goodness-of-fit of the genetic profile. Various pseudo-R(2) measures for binary responses have been proposed. However, there is no standard or consensus measure because the concept of residual variance is not easily defined on the observed probability scale. Unlike other nongenetic predictors such as environmental exposure, there is prior information on genetic predictors because for most traits there are estimates of the proportion of variation in risk in the population due to all genetic factors, the heritability. It is this useful ability to benchmark that makes the choice of a measure of goodness-of-fit in genetic profiling different from that of nongenetic predictors. In this study, we use a liability threshold model to establish the relationship between the observed probability scale and underlying liability scale in measuring R(2) for binary responses. We show that currently used R(2) measures are difficult to interpret, biased by ascertainment, and not comparable to heritability. We suggest a novel and globally standard measure of R(2) that is interpretable on the liability scale. Furthermore, even when using ascertained case-control studies that are typical in human disease studies, we can obtain an R(2) measure on the liability scale that can be compared directly to heritability.
منابع مشابه
Evaluation of Genetic Variation and Parameters of Fatty Acid Profile in Doubled Haploid Lines of Camelina sativa L.
After cereals, oilseeds are the second-largest food reserves in the world. According to available statistics, more than 95 percent of Iran's oil needs are imported. Given the growing need for edible oils in Iran, it is important to identify fatty acids in the oilseed crops. Camelina sativa L. is an oil-medicinal plant and belongs to the Brassicaceae family that requires very little water and fe...
متن کاملDetermination of genetic uniformity in transgenic cotton plants using DNA markers (RAPD and ISSR) and SDS-PAGE
One concern about using transgenic plants is the genetic variation that occurred from theirs tissue culture and regeneration. Molecular markers are an important element for efficient and effective determination of genetic variation. The present work was carried out to assess the genetic uniformity of transgenic cottons (Bt and chitinase lines), using RAPD, ISSR molecular markers and SDS-PAGE an...
متن کاملAMELX and AMELY Structure and Application for Sex Determination of Iranian Maral deer (Cervus elaphus maral)
In order to have a good perspective of wild animals, it is necessary to determine their population and genetic structure. It provides an opportunity to decide on better conservation managements. In the wilderness, due to the escapable nature and sometimes not having the distinguishable bisexual appearance, sex identification could be difficult by observing animals. The X- and Y- chromosome link...
متن کاملشبیهسازی منحنیهای پسماند رسوب رودخانه صوفی چای در مواقع سیلابی
Information on suspended sediment variation in times of flood is important in management of water resources, particularly management of basins, and in investigation of the causes of erosion. The relationship between discharge and suspended sediment concentration during floods is not similar and homogeneous for different reasons such as precipitation variety, discharge rate and sources of sedime...
متن کاملA New Algorithm for Optimum Voltage and Reactive Power Control for Minimizing Transmission Lines Losses
Reactive power dispatch for voltage profile modification has been of interest Abstract to powerr utilities. Usually local bus voltages can be altered by changing generator voltages, reactive shunts, ULTC transformers and SVCs. Determination of optimum values for control parameters, however, is not simple for modern power system networks. Heuristic and rather intelligent algorithms have to be so...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Genetic epidemiology
دوره 36 3 شماره
صفحات -
تاریخ انتشار 2012